Amharic-English Information Retrieval

نویسندگان

Atelach Alemu Argaw

Lars Asker

چکیده

We describe Amharic-English cross lingual information retrieval experiments in the adhoc bilingual tracs of the CLEF 2006. The query analysis is supported by morphological analysis and part of speech tagging while we used different machine readable dictionaries for term lookup in the translation process. Out of dictionary terms were handled using fuzzy matching and Lucene[4] was used for indexing and searching. Four experiments that differed in terms of utilized fields in the topic set, fuzzy matching, and term weighting, were conducted. The results obtained are reported and discussed.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Dictionary-based Amharic - English Information Retrieval

We present two approaches to the Amharic – English bilingual track in CLEF 2004. Both experiments use a dictionary based approach to translate the Amharic queries into English Bags-of-words, but while one approach removes non-content bearing words from the Amharic queries based on their IDF value, the other uses a list of English stop words to perform the same task. The resulting translated (En...

متن کامل

Amharic-English Information Retrieval with Pseudo Relevance Feedback

We describe cross language retrieval experiments using Amharic queries and English language document collection from our participation in the bilingual ad hoc track at the CLEF 2007. Two monolingual and eight bilingual runs were submitted. The bilingual experiments designed varied in terms of usage of long and short queries, presence of pseudo relevance feedback (PRF), and three approaches (max...

متن کامل

Dictionary Based Amharic-arabic Cross Language Information Retrieval

The demand for multilingual information is becoming perceptive as the users of the internet throughout the world are escalating and it creates a problem of retrieving documents in one language by specifying query in another language. This increasing demand can be addressed by designing automatic tools, which accepts the query in one language and retrieves the relevant documents in other languag...

متن کامل

The effectiveness of stemming for information retrieval in Amharic

Amharic is an example of a language with a very rich morphology, which means that systems for searching Amharic text databases can be effective in operation only if full account is taken of the many word variants that may occur. This paper reports a comparison of stem-based and root-based searching of Amharic text with conventional, word-based searching. The results demonstrate the effectivenes...

متن کامل

Mining a News Archive for a Comparable Corpus

The emergence of the World Wide Web has provided new and important opportunities to easily access and combine data and information from several different sources and thereby enabling the construction of new resources for researchers everywhere. In this paper we describe recent work that we have done in order to construct a comparable corpus consisting of Amharic and English news texts. In doing...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Amharic-English Information Retrieval

نویسندگان

چکیده

منابع مشابه

Dictionary-based Amharic - English Information Retrieval

Amharic-English Information Retrieval with Pseudo Relevance Feedback

Dictionary Based Amharic-arabic Cross Language Information Retrieval

The effectiveness of stemming for information retrieval in Amharic

Mining a News Archive for a Comparable Corpus

عنوان ژورنال:

اشتراک گذاری